
THIS WILL INSURE THAT THE 
WILL BE ERROR FREE. 

IF YOU HAVE ANY FURTHER QUESTIONS, PLEASE CALL: 
MARK SPENCER 703-308-4212 




ATTN 



Cf\n>wt\ i^t- » ■ — ■ — 

, ., p, CASES- PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WH.CH WERE INSERTED BY PTO SOFTWARE 

Please adjust your right margin to .3. as this wil. prevent "wrapp.ng . 



Wrapped Aminos 



3 O Incorrect Line Length 

4 Misaligned Amino Acid 

Numbering 

5 Non-ASCII 



Variable Length 



7 Patentln ver. 2.0 "bug" 



Skipped Sequences 
(OLD RULES) 



Skipped Sequences 
(NEW RULES) 



10 Use of n's or Xaa's 

(NEW RULES) 



The amino acid number/text at the end of each line "wrapped " down to the next ..ne. 
Th s may occur If your fi.e was retrieved in a word processor after creabng ... 
Please adjust your right margin to .3. as mis wi.l prevent -wra P p.ng . 

t,» n„ eS require mat a I n - - ™* 72 characters in .ength. This Includes spaces. 

Sequence^, __ conta* n's or Xaa's which represented more than one residue. 
As per the rules, each n or Xaa can only represent a single residue 
pteSl present me maximum number o, each residue having vanable .ength and 
Indicate in the <ix) feature section that some may be m.ssmg. 

a *?->n> <?23> section to be missing from amino acid 

to the subsequent amino acid sequence. 

"r/oS~™^Oo « „ Md*. — «U-« CH^CT 6R ,ST,0S-, 

(xl) SEQUENCE DESCRtPTIONrSEQ ID NO:X: 
This sequence Is Intentionally skipped 

P,ease aiso adjust the "(Hi) NUMBER OF SEQUENCES:" response to indude me skipped sequences,. 
Seouence(s) __ missing. If intend, p.ease use me toliowing format for each skipped seouenc, 
<210> sequence Id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Use of <220> to <223> is MANDATORY ^ wh ich residue n or Xaa represents. 

In <220> to <223> section, please expla.n locabon of n or Xaa. and wn.cn 



11 Use of <213>Organism 

(NEW RULES) 

12 Use of <220>Feature 

(NEW RULES) 



13 Patentln ver. 2.0 "bug" 



Sequence(s) . 



are missing this mandatory field <x its response. 



Pleas . do not use "Copy to D.sk" funct.on of ^ence UsUng, 

f„e. resuming in missing mandatory numeric idenUfiers and responses 
.nstead. p.ease use "File Manager" or any other means to copy We to floppy d.sk. 
AKS-Biotechnology Systems Branch- 5/15/99 
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DATE: 10/28/1999 

PATENT~APPLI CATION US/09/02 9 , 042A 

Input Set: I029042A.RAW 



PAGE : 1 «K«!^;H5^ /(1 , / oa..O«2» Tim! 14.00,36 



This Raw Listing contains the General 
information Section and those Sequences 
containing ERRORS. ^ 



Does Not 
Corrected Diskette 



Comply 
Needed 



? 



W- 



1 
2 
3 
4 

5 
6 
7 
3 



«!!., Ki„, sun-voun,,. Ki„, Kee-Kc»,- Tae-Han, <^ 

_ n^npHon svstem usmq Avian Cells * / « ^ 



<110> Kim, Sun-Young; *-liu, mw~, , ,, 

<120> Het^ologous Protein Production System using Avxan Cells 

<130> 

<140> US/09/029 / 042A 

<141> 1998-05-15 J^^TT 7vi 

<150> PCT/KR96/00145 A QiJ Jj^M^^ / w 

<151> 1996-08-23 
<160> 11 



ERRORED 




<212> DNA i _/ *""' 

^GTCGCTCCCT CTGOGCCTCC CAGTCCTGGG CGCCCCACCA CGCCTCATCT ,00 ^ 



GTGACAGCCG AGTCCTGGAG AGGTACCTCT TGGAGGCCAA GGAGGCCGAG 



AATATCACGG ^^S^^ 
AGGGCTTCAG G^clSTcCCAO^TCC AGGAACCTGG CACTTGGTTT 
GGGGTGGAGT TGGGAAGCTA GACACTGCCC CCCTACATAA GAATAAGTCT 
GGTGGCCCCA AACCATACCT GGAAACTAGG CAAGGAGCAA AGCCAGCAGA 
TCCTACGGCC TGTGGGCCAG GGCCAG^GC CTTCAGGGAC CCTTGACTCC 
CCGGGCTGTG TGCATTTCAG ACGGGCTGTG CTGAACACTG CAGCTTGAAT 
GAGAATATCA CTGTCCCAGA CACCAAAGTT AATTTCTATG CCTGGAAGAG 
GATGGAGGTG AGTTCCTTTT TTTTTTTTTT TCCTTTCTTT TGGAGAATCT 
CATTTGCGAG CCTGATTTTG GATGAAAGGG AGAATGATCG GGGGAAAGGT 



AAAATGGAGC AGCAGAGATG 



AGGCTGCCTG GGCGCAGAGG CTCACGTCTA 



TAATCCCAGG CTGAGATGGC CGAGATGGGA GAATTGCTTG AGCCCTGGAG 



Oo ajot 



1 * . 
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PAGE: 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/029 , 042A 



DATE: 10/28/1999 
TIME: 14:00:56 



E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 

E--> 

W--> 



E--> 
E--> 
W--> 
E--> 
W--> 
E--> 
W--> 
E--> 
W--> 



40 

41 

42 

43 

44 

45 

46 

47 

46 

49 

50 

51 

52 

53 

54 

55 

56 

57 

58 

59 

60 

61 

62 

63 

64 

65 

66 

67 

68 

69 

70 

71 

72 

73 

74 

75 



76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 



<210> 2 
<211> 1582 
<212> DNA 
<213> er ythropoietin Q_ 

^^ATGGGGGTGC ACGAATGTCC TGCCTGGCTG TGGCTTCTCC TGTCCCTGCT 
GTCGCTCCCT CTGGGCCTCC CAGTCCTGGG CGCCCCACCA CGCCTCATCT 

GTGACAGCCG AGTCCTGGAG AGGTACCTCT GGAGGCCAAG GAGGCCGAG 

AATATCACGG TGAGACCCCT TCCCCAGCAC ATTCCACAGA ACTCACGCTC 

AGGGCTTCAG GcQttCTCCT CCCAG^TCC AGGAACCTGG CACTTGGTTT 



Input Set: I029042A.RAW 
GTTCAGACCA ACCTAGGCAG CATAGTGAGA TCCCCCATCT CTACAAACAT 
TTAAAAAAAT TAGTCAGGTG AAGTGGTGCA TGGTGGTAGT CCCAGATATT 
TGGAAGGCTG AGGCGGGAGG ATCGCTTGAG CCCAGGAATT TGAGGCTGCA 
GTGAGCTGTG ATCACACCAC TGCACTCCAG CCTCAGTGAC AGAGTGAGGC 
CCTGTCTCAA AAAAGAAAAG AAAAAAGAAA AATAATGAGG GCTGTATGGA 
ATACATTCAT TATTCATTCA CTCACTCACT CACTCATTCA TTCATTCATT 
CATTCAACAA GTCTTATTGC ATACCTTCTG TTTGCTCAGC TTGGTGCTTG 
GGGCTGCTGA GGGGCAGGAG GGAGAGGGTG ACATGGGTCA GCTGACTCCC 
AGAGTCCACT CCCTGTAGGT CGGGCAGCAG GCCGTAGAAG TCTGGCAGGG 
CCTGGCCCTG CTGTCGGAAG CTGTCCTGCG GGGCCAGGCC CTGTTGGTCA 
ACTCTTCCCA GCCGTGGGAG CCCCTGCAGC TGCATGTGGA TAAAGCCGTC 

AGTGGCCTTC GCAGCCTCAC CACTCTGCTT CGGGCTCTGG GAGCCCAGGT 

GAGTAGGAGC GGACACTTCT GCTTGCCCTT TCTGTAAGAA GGGGAGAAGG 

GTCTTGCTAA GGAGTACAGG AACTGTCCGT ATTCCTTCCC TTTCTGTGGC 

ACTGCAGCGA CCTCCTGTTT TCTCCTTGGC AGAAGGAAGC CATCTCCCCT 

CCAGATGCGG CCTCAGCTGC TCCACTCCGA ACAATCACTG CTGACACTTT 

CCGCAAACTC TTCCGAGTCT ACTCCAATTT CCTCCGGGGA AAGCTGAAGC 

TGTACACAGG GGAGGCCTGC AGGACAGGGG ACAGATGA 
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B1M oirOTTENCE LISTING DATE : 10/28/1999 

PAGE : 3 ^f A ^L?CA"o™US/09/029,042A TIME : 14:00:56 

Input Set: I029042A.RAW 

GGGGTGGAGT TGGGAAGCTA GACACTGCCC CCCTACATAA GAATAAGTCT 
GGTGGCCCCA AACCATACCT GGAAACTAGG CAAGGAGCAA AGCCAGCAGA 
tcctacQgcc tgtgcQccag GGCCAG^GC CTTCAGGGAC CCTTGACTCC 
CCGGGCTGTG TGCATTTCAG ACGGGCTGTG CTGAACACTG CAGCTTGAAT 
GAGAATATCA CTGTCCCAGA CACCAAAGTT AATTTCTATG CCTGGAAGAG 
GATGGAGGTG AGTTCCTTTT TTTTTTTTTT TCCTTTCTTT TGGAGAATCT 
CATTTGCGAG CCTGATTTTG GATGAAAGGG AGAATGATCG AGGGAAAGGT 
AAAATGGAGC AGCAGAGATG AGGCTGCCTG GGCGCAGAGG CTCACGTCTA 
TAATCCCAGG CTGAGATGGC CGAGATGGGA GAATTGCTTG AGCCCTGGAG 
GTTCAGACCA ACCTAGGCAG CATAGTGAGA TCCCCCATCT CTACAAACAT 
TTAAAAAAAT TAGTCAGGTG AAGTGGTGCA TGGTGGTAGT CCCAGATATT 
TGGAAGGCTG AGGCGGGAGG ATCGCTTGAG CCCAGGAATT TGAGGCTGCA 
GTGAGCTGTG ATCACACCAC TGCACTCCAG CCTCAGTGAC AGAGTGAGGC 
CCTGTCTCAA AAAAGAAAAG AAAAAAGAAA AATAATGAGG GCTGTATGGA 
ATACGTTCAT TATTCATTCA CTCACTCACT CACTCATTCA TTCATTCATT 
CATTCAACAT GTCTTATTGC ATACCTTCTG TTTGCTCAGC TTGGTGCTTG 
AGAGTCCACT CCCTGTAGGT CGGGCAGCAG GCCGTAGAAG TCTGGCAGGG 
CCTGGCCCTG CTGTCGGAAG CTGTCCTGCG GGGCCAGGCC CTGTTGGTCA 
ACTCTTCCCA GCCGTGGGAG CCCCTGCAGC TGCATGTGGA TAAAGCCGTC 
AGTGGCCTTC GCAGCCTCAC CACTCTGCTT CGGGCTCTGG GAGCCCAGGT 
GAGTAGGAGC GGACACTTCT GCTTGCCCTT TCTGTAAGAA GGGGAGAAGG 
GTCTTGCTAA GGAGTACAGG AACTGTCCGT ATTCCTTCCC TTTCTGTGGC 
ACTGCAGCGA CCTCCTGTTT TCTCCTTGGC AGAAGGAAGC CATCTCCCCT 
CCAGATGCGG CCTCAGCTGC TCCACTCCGA ACAATCACTG CTGACACTTT 
CCGCAAACTC TTCCGAGTCT ACTCCAATTT CCTCCGGGGA AAGCTGAAGC 



E--> 89 

W--> 90 

E--> 91 

W--> 92 

E--> 93 

W--> 94 

E--> 95 

W--> 96 

E--> 97 

W--> 98 

E--> 99 

W--> 100 

E--> 101 

W--> 102 

E--> 103 

W--> 104 

E--> 105 

W--> 106 

E--> 107 

W--> 108 

E--> 109 

W--> HO 

E--> HI 

W--> 112 

E--> 113 

W--> 114 

E--> 115 

W--> 116 

E--> 117 

W--> 118 

E--> 119 

W--> 120 

E--> 121 

W--> 122 

E--> 123 

W--> 124 

E--> 125 

W--> 126 

E--> 127 

W--> 128 

E--> 129 

W--> 130 

E--> 131 

W--> 132 

E--> 133 

W--> 134 
E--> 135 
W--> 136 
E--> 137 
W--> 138 
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Input Set: I029042A.RAW 



E--> 139 
W--> 140 



TGTACACAGG GGAGGCCTGC AGGACAGGGG ACAGATGA 



144 

E--> 145 

E--> 146 

W--> 147 

E--> 148 

W--> 149 

E--> 150 

W--> 151 

E--> 152 

W--> 153 

E--> 154 

W--> 155 

E--> 156 

W--> 157 

E--> 158 

W--> 159 

E--> 160 

W--> 161 

E--> 162 

W--> 163 

E--> 164 

W--> 165 

E--> 166 

W--> 167 

E--> 168 

W--> 169 

E--> 170 

W--> 171 

E--> 172 

W--> 173 

E--> 174 

W--> 175 

E--> 176 

W--> 177 

E--> 178 

W--> 179 

E--> 180 

W--> 181 

E--> 182 

W--> 183 

E--> 184 

W--> 185 

E--> 186 

W--> 187 



143 <212> DNA - 

gSH^gg^SHAcC TGCCTGGCTG TGGCTTCTCC TGTCCCTGCT 
' ' OTCGCTCCCT CTGGGCCTCC CAGTCCTGGG CGCCCCACCA CGCCTCATCT 



GTGACAGCCG AGTCCTGGAG AGGTACCTCT TGGAGGCCAA GGAGGCCGAG 
AATATCACGG TGAGACCCCT TCCCCAGCAC ATTCCACAGA ACTCACGCTC 
AGGGCTTCAG GC0ACTCCT CCCAC^TCC AGGAACCTGG CACTTGGTTT 
GGGGTGGAGT TGGGAAGCTA GACACTGCCC CCCTACATAA GAATAAGTCT 
GGTGGCCCCA AACCATACCT GGAAACTAGG CAAGGAGCAA AGCCAGCAGA 
TCCTACGGCC TGTGGGCCAG GGCCAA&AC CTTCAGGGAC CCTTGACTCC 
CCGGGCTGTT TGCATTTCAG ACGGGCTGTG CTGAACACTG CAGCTTGAAT 
GAAAATATCA CTGTCCCAGA CACCAAAGTT AATTTCTATG CCTGGAAGAG 
GATGGAGGTG AGTTCCTTTT TTTTTTTTTT TCCTTTCTTT TGGAGAATCT 
CATTTGCGAG CCTGATTTTG GATGAAAGGG AGAATGATCG AGGGAAAGGT 
AAAATGGAGC AGCAGAGATG AGGCTGCCTG GGCGCAGAGG CTCACGTCTA 
TAATCCCAGG CTGAGATGGC CGAAATGGGA GAATTGCTTG AGCCCTGGAG 
GTTCAGACCA ACCTAGGCAG CATAGTGAGA TCCCCCATCT CTACAAACAT 
TTAAAAAAAT TAGTCAGGTG AAGTGGTGCA TGGTGGTAGT CCCAGATATT 
TGGATGGCTG AGGCGGGAGG ATCGCTTGAG CCCAGGAATT TGAGGCTGCA 
GTGAGCTGTG ATCACACCAC TGCACTCCAG CCTCAGTGAC AGAATGAGGC 
CCTGTCTCAA AAAAGAAAAG AAAAAAGAAA AATAATGAGG GCTGTATGGA 
ATACATTCAT TATTCATTCA CTCACTCACT CACTCATCCA TTCATTCATT 
CATTCAACAA GTCTTATTGC ATACCTTCTG TTTGCTCAGC TTGGTGCTCG 
GGGCTGCTGA GGGGCAGGAG GGAGAGGGTG ACATGGGTCA GCTGACTCCC 
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*AW SEQUENCE LISTING DATE : 10/28/1999 

PAGE : 5 ^J E ^ TIME : 14:00:56 

Input Set: I029042A.RAW 

AGAGTCCACT CCCTGTAGGT CGGGCAACAG GCCGTAGAAG TCTGGCAGGG 
CCTGGCCCTG CTGTCGGAAG CTGTCCTGCG GGGCCAGGCC CTGTTGGTCA 
ACTTTTCCCA GCCGTGGGAG CCCCTGCAGC TGCATGTGGA TAAAGCCGTC 
AGTGGCCTTC GCAGCCTCAC CACTCTGCTT CGGGCTCTGG GAGCCCAGGT 
GAGTAGGAGC GGACACTTCT GCTTGCCCTT TCTGTAAGAA GGGGAGAAGG 
GTCTTGCTAA GGAGTACAGG AACTGTCCGT ATTCCTTCCC TTTCTGTGGC 
ACTGCAGCGA CCTCCTGTTT TCTCCTTGGC AGAAGGAAGC CATCTCCCCT 
CCAGATGCGG CCTCAGCTGC TCCACTCCGA ACAATCACTG CTGACACTTT 
CCGCAAACTC TTCCGAGTCT ACTCCAATTT CCTCCGGGGA AAGCTGAAGC 
TGTACACAGG GGAGGCCTGC AGGACAGGGG ACAGATGA 



B--> 188 

W--> 189 

E--> 190 

W--> 191 

E--> 192 

W--> 193 

E--> 194 

W--> 195 

E--> 196 

W--> 197 

E--> 198 

W--> 199 

E--> 200 

W--> 201 

E--> 202 

W--> 203 

E--> 204 

W--> 205 

E--> 206 

W--> 207 



208 <210> 4 

209 <211> 1585 

210 <212> DNA 



211 

E--> 212 

E--> 213 

W--> 214 

E--> 215 

W--> 216 

E--> 217 

W--> 218 

E--> 219 

W--> 220 

E--> 221 

W--> 222 

E--> 223 

W--> 224 

E--> 225 

W--> 226 

E--> 227 

W--> 228 

E--> 229 

W--> 230 

E--> 231 

W--> 232 

E--> 233 

W--> 234 

E--> 235 

W--> 236 



<212> DNA . / 

^ KSS e ^a TGCCTGGCTG TGGCTTCTCC = 
GTCGCTCCCT CTGGGCCTCC CAGTCCTGGG CGCCCCACCA CGCCTCATCT 

GTGACAGCCG AGTCCTGGAG AGGTACCTCT TGGAGGCCAA GGAGGCCGAG 

AATATCACGG TGAGACCCCT TCCCCAGCAC ATTCCACAGA ACTCACGCTC 

AGGGCTTCAG GG^ACTCCT CCCAgQaTCC AGGAACCTGG CACTTGGTTT 

GGGGTGGAGT TGGGAAGCTA GACACTGCCC CCCTACATAA GAATAAGTCT 

GGTGGCCCCA AACCATACCT GGAAACTAGG CAAGGAGCAA AGCCAGCAGA 

TCCTACGGCC TGTGGGCCAG GGCCAGGAGC CTTCAGGGAC CCTTGACTCC 

CCGGGCTGTG TGCATTTCAG AAGGGCTGTG CTGAACACTG CAGCTTGAAT 

GAGAATATCA CTGTCCCAGA CACCAAAGTT AATTTCTATG CCTGGAAGAG 

GATGGAGGTG AGTTCCTTTT TTTTTTTTTT TCCTTTCTTT TGGAGAATCT 

CATTTGCGAG CCTGATTTTG GATGAAAGGG AGAGTGATCG AGGGAAAGGT 

AAAATGGAGC AGCAGAGATG AGGCTGCCTG GGCGCAGAGG CTCACGTCTA 
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E--> 237 
W--> 238 
239 



3--> 



Input Set: I029042A.RAW 

TAATCCCAGG CTGAGATGGC CGAGATGGGA GAATTGCTTG AGCCCTGGAG 
GTTCAGACCA ACCTAGGCAG CATAGTGAGA TCCCCCATCT CTACAAACAT 
TTAAAAAAAT TAGTCAGGTG AAGTGGTGCA TGGTGGTAGT CCCAGATATT 
TGGAAGGCTG AGGCGGGAGG ATCGCTTGAG CCCAGGAATT TGAGGCTGCG 
GTGAGCTGTG ATCACACCAC TGCACTCCAG CCTCAGTGAC AGAGTGAGGC 
CCTGTCTCAA AAAAGAAAAG AAAAAAGAAA AATAATGAGG GCTGTATGGA 
ATACATTCAT TATTCATTCA CTCACTCACT CACTCATTCA TTCATTCATT 
CATTCAACAA GTCTTATTGC ATACCTTCTG TTTGCTCAGC TTGGTGCTTG 
GGGCTGCTGA GGGGCAGGAG GGTGAGGGTG ACATGGGTCA GCTGACTCCC 
AGAGTCCACT CCCTGTTGGT CGGGCAGCAG GCCGTAGAAG TCTGGCAGGG 
CCTGGCCCTG CTGTCGGAAG CTGTCCTGCG GGGCCAGGCC CTGTTGGTCA 
ACTCTTCCCA GCCGTGGGAG CCCCTGCAGC TGCATGTGGA TAAAGCCGTC 
AGTGGCCTTC GCAGCCTCAC CACTCTGCTT CGGGCTCTGG GAGCCCAGGT 
GAGTAGGAGC GGACACTTCT GCTTGCCCTT TCTGTAAGAA GGGGAGAAGG 
GTCTTGCTAA GGAGTACAGG AACTGTCCGT ATTCCTTCCC TTTCTGTGGC 
ACTGCAGCGA CCTCCTGTTT TCTCCTTGGC AGAAGGAAGC CATCTCCCCT 
CCAGATGCGG CCTCAGCTGC TCCACTCCGA ACAATCACTG CTGACACTTT 
CCGCAAACTC TTCCGAGTCT ACTCCAATTT CCTCCGGGGA AAGCTGAAGC 
TGTACACAGG GGAGGCCTGC AGGACAGGGG ACAGATGA 



W--> 240 

E--> 241 

W--> 242 

E--> 243 

W--> 244 

E--> 245 

W--> 246 

E--> 247 

W--> 248 

E--> 249 

W--> 250 

E--> 251 

W--> 252 

E--> 253 

W--> 254 

E--> 255 

W--> 256 

E--> 257 

W--> 258 

E--> 259 

W--> 260 

E--> 261 

W--> 262 

E--> 263 

W--> 264 

E--> 265 

W--> 266 

E--> 267 

W--> 268 

E--> 269 

W--> 270 

E--> 271 

W--> 272 

E--> 273 

W--> 274 



278 

E--> 279 

E--> 280 

W--> 281 

E--> 282 

W--> 283 

E--> 284 

W--> 285 



275 <210> 5 

276 <211> 1583 If 

277 <212> DNA ^ J]^ 



(fti l Igocl^ cc TG cct GG ct/; gG cttctcc TG tccct G ct 
^!!^ ^CQCTCCCT CTCWCCCTCC ca G tcct GGG cqccccacca cqcctcatct 

QTQACAQACO AQTCCTQOAQ AQOTACCTCT T GG A GG CCAA QQAQQCCQAQ 
AATATCACOQ TGAGACCCCT TCCCCAOCAC ATTCCACAQA ACTCAC6CTC 
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ewnnirorF LISTING DATE: 10/28/1999 

7 SSsSFESScSES OS/09/029. 042A TIME : 14:00:56 

Input Set: I029042A.RAW 

AGGGCTTCAG GgQ^ACTCCT CCCAG^TCC AGGAACCTGG CACTTGGTTT 
GGGGTGGAGT TGGGAAGCTA GACACTGCCC CCCTACATAA GAATAAGTCT 
GGTGGCCCCA AACCATACCT GGAAACTAGG CAAGGAGCAA AGCCAGCAGA 
TCCTACGGCC TGTGGCCCAG GGGCaQgAGC CTTCAGGGAC CCTTGACTCC 
CCGGGCTGTG TGCATTTCAG ACGGGCTGTG CTGAACACTG CAGCTTGAAT 
GAGAATATCA CTGTCCCAGA CACCAAAGTT AATTTCTATG CCTGGAAGAG 
GATGGAGGTG AGTTCCTTTT TTTTTTTlQ TCCTTTCTTT TGGAGAATCT 
CATTTGCGAG CCTGATTTTG GATGAAAGGG AGAATGATCG AGGGAAAGGT 
AAAATGGAGC AGCAGAGATG AGGCTGCCTG GGCGCAGAGG CTCACGTCTA 
TAATCCCAGG CTGAGACGGC CGAGATGGGA GAATTGCTTG AGCCCTGGAG 
GTTCAGACCA ACCTAGGCAG CATAGTGAGA TCCCCCATCT CTACAAACAT 
TTAAAAAAAT TAGTCAGGTG AAGTGGTGCA TGGTGGTAGT CCCAGATATT 
TGGAAGGCTG AGGCGGGAGG ATCGCTTGAG CCCAGGAATT TGGGGCTGCA 
GTGAGCTGTG ATCACACCAC TGCAATCCAG CCTCAGTGAC AGAGTGAGGC 
CCTGTCTCAA AAACGAAAAG AAAAAAGAAA AATAATGAGG GCTGTATGGA 
ATACATTCAT TATTCATTCA CTCACTCACT CACTCATTCA TTCATTCATT 
CATTCAACAA GTCTTATTGC ATACCTTCTG TTTGCTCAGC TTGGTGCTTG 
GGCCTTCTGA GGGGCAGGAG GGAGAGGGTG ACATGGGTCA GCTGACTCCC 
AGAGTCCACT CCCTGTAGGT CGGGCAGCAG GCCGTAGAAG TCTGGCAGGG 
CCTGGCCCTG CTGTCGGAAT CTGTCCTGCG GGGCCAGGCC CTGTTGGTCA 
ACTCTTCCCA ACCGTGGGAG CCCCTGCAGC TGCATGTGGA TAAAGCCGTC 
AGTGGCCTTC GCAGCCTCAC CACTCTGCTT CGGGCTCTGG GAGCCCAGTT 
GAGTAGGAGG GGACACTTCT GCTTGCCCTT TGTGTAAGAA GGAGAGAAGG 
GTCTTGCTAA GGAGTACAGG AACTGTCCGT ATTCCTTCCC TTTCTGTGGC 
ACTGCAGCGA CCTCCTGTTT TCTCCTTGGC AGAAGGAAGC CATCTCCCCT 



E--> 286 

W--> 287 

E--> 288 

W--> 289 

E--> 290 

W--> 291 

E--> 292 

W--> 293 

E--> 294 

W--> 295 

E--> 296 

W--> 297 

E--> 298 

W--> 299 

E--> 300 

W--> 301 

E--> 302 

W--> 303 

E--> 304 

W--> 305 

E--> 306 

W--> 307 

E--> 308 

W--> 309 

E--> 310 

W--> 311 

E--> 312 

W--> 313 

E--> 314 

W--> 315 
E--> 316 
W--> 317 
E--> 318 
W--> 319 
E--> 320 
W--> 321 
E--> 322 
W--> 323 
E--> 324 
W--> 325 
E--> 326 
W--> 327 
E--> 328 
W--> 329 
E--> 330 
W--> 331 
E--> 332 
W--> 333 
E--> 334 
W--> 335 
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PAGE: B ^? Q S™U S /09/0 2 9.042A ?S5! "M" 

input Set: I029042A.RAW 



E--> 336 

W--> 337 

E--> 338 

W--> 339 

E--> 340 

W--> 341 



CCAGATGCGG CCTCAGCTGC TCCACTCCGA ACAATCACTG CTGATACTTT 
CCGCAAACTC TTCCGAGTCT ACTCCAATTT CCTCCGGGGA AAGCTGAAGC 
TGTACACAGG GGAGGCCTGC AGGACAGGGG ACAGATGA 



342 <210> 6 

343 <211> 1586 

344 <212> DNA 



<212> UNA , 

= = = 

OTGACAGCCG AGTCCTGGAG AGGTACCTCT TGGAGGCCAA GGAGGCCGAG 
AATATCACGG TGAGACCCCT TCCCCAGCAC ATTCCACAGA ACTCACGCTC 
AGGGCTTCAG GGGAACTCCT CCCAGGATCC AGGAACCTGG CACTTGGTTT 
GGGGTGGAGT TGGGAAGCTA GACACTGCCC CCCTACATAA GAATAAGTCX 
GGTGGCCCCA AACCATACCT GGAAACTAGG CAAGGAGCAA AGCCAGCAGA 
TCCTACGGCC TGTGGCCCAG GGCCA^AGC CTTCAGGGAC CCTTGACTCC 
I'-"-* 36? CCGGGCTGTG TGCATTCCAG ACGGGCTGTG CTGAACACTG CAGCTTGAAT 

GAGAATATCA CTGTCCCAGA CACCAAAGTT AATTTCTATG CCTGGAAGAG 
GATGGAGGTG AGTTCCTTTT TTTTTTTTTT TCCTTTCTTT TGGAGAATCT 
CATTTGCGAG CCTGATTTGG GATGAAAGGG AGAATGATCG AGGGAAAGGT 
AAAATGGAGC AGCAGAGATG AGGCTGCCTG GGCGCAGAGG CTCCAGTCTA 
TAATCCCAGG CTGAGATGGC CGAGATGGGA GAATTGCTTG AGCCCTGGAG 
GTTCAGACCA ACCTAGGCAG CCTAGTGAGA TCCCCCATCT CTACAAACAT 
TTAAAAAAAT TAGTCAGGTG AAGTGGTGCA TGGTGGTAGT CCCAGATATT 
TGGAAGGCTG AGGCGGGAGG ATCGCTTGAG CCCAGGAATT TGAGGCTGCA 
GTGAGCTGTG ATCACACCAC TGCACTCCAG CCTCAGTGAC AGAGTGAGGC 
CCTGTCTCAA AAAAGAAAAG AAAAAAGAAA AATTATGAGG GCTGTATGGA 
ATACATTCAT TATTCATTCA CTCACTCACT CACTCATTCA TTCATTCATT 



345 

B--> 346 

E--> 347 

W--> 348 

E--> 349 

W--> 350 

E--> 351 

W--> 352 

E--> 353 

W--> 354 

E--> 355 

W--> 356 

E--> 357 

W--> 358 

E--> 359 

W--> 360 

E--> 361 

W--> 362 

E--> 363 

W--> 364 

E--> 365 

W--> 366 

E--> 367 

W--> 368 

E--> 369 

W--> 370 

E--> 371 

W--> 372 

B — > 373 

W--> 374 

E--> 375 

W--> 376 

E--> 377 

W--> 378 

E--> 379 

W--> 380 

E--> 381 

W--> 382 

E--> 383 

W--> 384 
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PAGE : 



E--> 
W--> 
E--> 
W--> 
E--> 
W--> 
E--> 
W--> 
E--> 
W--> 
E--> 
W--> 
E--> 
W--> 
E--> 
W--> 
E--> 
W--> 
E--> 
W--> 
E--> 
W--> 
E--> 
W--> 



385 

386 

387 

388 

389 

390 

391 

392 

393 

394 

395 

396 

397 

398 

399 

400 

401 

402 

403 

404 

405 

406 

407 

408 



409 
411 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



412 

413 

414 

415 

416 

417 

418 

419 

420 

421 

422 

423 

424 

425 

426 

427 

428 

429 

430 

431 

432 

433 



RAW SEQUENCE LISTING 
PATENT APPLICATION US/09/029 , 042A 



DATE: 10/28/1999 
TIME: 14:00:56 



CATTCAACAA 
GGGCTGCTGA 
AGAGTCCACT 
CCTGGCCCTG 
ACTCTTCCCA 
AGTGGCCTTC 
GAGTAGGAGC 
GTCTTGCTAA 
ACTGCAGCGA 
CCAGATGCGG 
CCGCAAACTC 
TGTACACAGG 



Input Set: I029042A.RAW 

GTCTTATTGC ATACCTTCTG TTTGCTCAGC TTGGTGCTTG 
GGGGCAGGAG GGAGAGGGTG ACATGGGITC ACTGACTCCC 
CCCTGTAGGT GGGGCAGGAG GCCGTAGAAG TCTGGCAGGG 
CTGTCGGAAG CTGTCCTGCG GGGCCAGGCC CTGTTGGTCA 
GCCGTGGGAG CCCCTGCAGC TGCATGTGGA TAAAGCCGTC 
GCAGCCTCAC CACTCTGCTT CGGGCTCTGG GAGCCCAGGT 
GGACACTTCT GCTTGCCCTT TCTGTAAGAA GGGGAGAAGG 
GGAGTACAGG ATCTGTCCGT ATTCCTTCCC TTTCTGTGGC 
CCACCTGTTT TCTCCTTGGC AGAAGGAAGC CATCTCCCCT 
CCTCAGCTGC TCCACTCCGA ACAATCACTG CTGACACTTT 
TTCCGAGTCT ACTCCAATTT CCTCCGGGGA GAGCTGAAGC 
GGAGGCCTGC AGGACAGGGG ACGGATGA 



Jo .Jf iUJ ^ ^ ^ h 



<210> 7 
<211> 193 

<212> PRT O ^ , - - 

1 1 i.ii , r^rr?pnirri" fT np — 9 1 ^, LBU Trp Leu Leu Leu Ser Leu 

(^f^TpTl" s g £ & £ - r ,iVL 

Ile cys Asp ser Ar g V* - «u «. Leu Leu Ou A!a Lys ^ 



, „ \ 1a n,i His Cvs Ser Leu Asn Glu Asn lie 

Asn lie Thr Thr Gly Cys Ala Glu Hxs Cys f^JU^ 

Pro w Glu Leu His val Asp Lys Ala val ser oly Leu Arg ser Leu ^ ^ ^ 

Th r Thr Leu Leu Ar g Ala Leu sly ,1a Gin Lys Glu »U .1. ser Pro ^ ^ 



Th r v.! Pro Asp Thr Lya v.l As„ Phe Tyr Ala Trp Lys Ar g Met Glu 

val GlvGlu «- <*« Val TtP ^ 

Glu Ala 

-^^U, Gly Glu Ala Leu Leu Val Asn Ser Ser Glu Pro Trp Glu 
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10 



RAW SEQUENCE LISTING 

MENT APPLICATION US/09/02 9 , 042A 



DATE: 10/28/1999 
TIME: 14:00:56 



Input Set: I029042A.RAW 



E--> 



434 
435 
436 
437 
438 
439 
440 
441 



442 
443 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



pro Asp 

^I^Tser Ala Ala Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg 
Tyr Thr Gly Glu Ala Cys Arg Thr Gly Asp Arg 



E--> 



E--> 



444 
445 
446 
447 
448 
449 
450 
451 
452 
453 
454 
455 
456 
457 
458 
459 
460 
461 
462 
463 
464 
465 
466 
467 
468 
469 
470 
471 
472 
473 
474 



<210> 8 

<211> 193 

<212> PRT 

^H-hrn poie Hn aene ^ 

<3ooV)Met Gly Val His Glu Cys 

Leu Ser Leu Pro Leu Gly 

He Cys Asp Ser Arg Val 
Ala Glu 



2 



Pro Ala 

Leu Pro 



Trp Leu Trp Leu Leu Leu Ser Leu 

Val Leu Gly Ala Pro Pro Arg Leu 



■fit 



E--> 



475 
476 
477 
478 
479 
480 
> 481 



Leu Glu Arg Tyr Leu Leu Glu Ala Lys Glu 




Asn He Thr Thr Gly Cys 
Thr Val Pro Asp Thr Lys 
Val Gly Gin Gin Ala Val 
Glu Al 



Val Leu Arg Gly Gin Ala 
Pro Leu Gin Leu His Val 
Thr Thr Leu Leu Arg Ala 
^ro^Asp^ 
Ala Ala Ser Ala Ala Pro 
Lys Leu Phe Arg Val Val 
Tyr Thr Gly Glu Ala Cys 



Ala Glu His Cys Ser Leu Asn Glu Asn He 
Val Asn Phe Tyr Ala Trp Lys Arg Met Glu 
Glu Val Trp Gin Gly Leu Ala Leu Leu Ser 

Val Asn Ser Ser -Gin Pro Trp Glu 
Ala val Ser Gly Leu Arg Ser Leu 
Ala Gin Lys Glu Ala He Ser Pro 




Leu Leu 
Asp Lys 
Leu Gly 



Leu Arg Thr He Thr Ala Asp Thr Phe Arg 
Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu 
Arg Thr Gly Asp Arg 



<210> 9 
<211> 193 
<212> PRT 

« ai?> erythr opoietin gene yj Leu Leu ger Leu 
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11 



E--> 



E--> 



E--> 



3--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



482 

483 

484 

485 

486 

487 

488 

489 

490 

491 

492 

493 

494 

495 

496 

497 

498 

499 

500 

501 

502 

503 

504 

505 

506 



507 
509 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



E--> 



510 

511 

512 

513 

514 

515 

516 

517 

518 

519 

520 

521 

522 

523 

524 

525 

526 

527 

528 

529 

530 



,» SEQUENCE "ST™ ™*> ^^f 9 

PATENT APPLICATION US/09/029 , 

Input Set: I029042A.RAW 

'Ala Glu 

Asn lie Thr Lys Gly Cys Ala Glu His Cys Ser Leu Asn Glu Asn He 

Val Gly Gin Gin Ala Val Glu Val Trp Gin Gly Leu Ala Leu Leu Ser 

Glu Ala 

Pro Leu Glu Leu Hie val Asp Lye Ala v.! ser oly Leu Arg ser Leu 
Thr Thr Leu Leu Arg Ala Leu Sly Ala Gin Lys 01. Ala He Ser- Pro 

^Pro Asp/ 

"H^a Ser Ala Ala Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg 
Lys Leu Phe Arg Val Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu 
Tyr Thr Gly Glu Ala Cys Arg Thr Gly Asp Arg 



<210> 10 
<211> 193 

<212> PRT , 

r ^ni.Hn aene v> /Q ^ ^ ger Leu . 

S « - Leu £ Ala « «, at, Leu 
He cys Asp Arg Arg val Leu Glu Arg Tyr Leu Leu Glu Ala Lys Glu 

Ala Glu 

A sn lie Thr Thr Gly Cys Ala Glu His Cys Ser Leu Asn Glu Asn He 
Thr Val Pro Asp Thr Lys Val Asn Phe Tyr Ala Trp Lys Arg Met Glu 
Val Gly_Gln Gin Ala Val Glu Val Trp Gin Gly Leu Ala Leu Leu Ser 

Glu Ser 

-^r^Arg Gly Gin Ala Leu Leu Val Asn Ser Ser Gin Pro Trp Glu 
Th r Thr Leu Leu Arg Ala Leu Gly Ala Gin Lys Glu Ala He Ser Pro 
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Input Set: I02 9042A.RAW 



E--> 


531 




532 


E--> 


533 




534 


E--> 


535 




536 


E--> 


537 




538 


E--> 


539 




540 


541 




542 




543 


E--> 


544 




545 


E--> 


546 




547 


E--> 


548 




549 


E--> 


550 




551 


E--> 


552 




553 


E--> 


554 




555 


E--> 


556 




557 


E--> 


558 




559 


E--> 


560 




561 


E--> 


562 




563 


E--> 


564 




565 


E--> 


566 




567 


E--> 


568 




569 


E--> 


570 




571 


E--> 


572 




Pro Asp 



Ala Ala Ser Ala Ala Pro Leu Arg Thr lie Thr Ala Asp Thr Phe Arg 
Lys Leu Phe Arg Val Tyr Ser Asn Phe Leu Arg Gly Lys Leu Lys Leu 
Tyr Thr Gly Glu Ala Cys Arg Thr Gly Asp Arg 



<210> 11 
<211> 193 
<212> PRT 

^^^^T^UL Ala Trp Leu Trp Leu Leu Leu Ser Leu 
^^STS ^ Pro Leu G^y Leu Pro Val Leu Gly Ala Pro Pro Arg Leu 

lie Cys Asp Ser Arg Val Leu Glu Arg Tyr Leu Leu Glu Ala Lys Glu 
Ala Glu 

AS n lie Thr Thr Gly Cys Ala Glu His Cys Ser Leu Asn Glu Asn He 
Thr Val Pro Asp Thr Lys Val Asn Phe Tyr Ala Trp Lys Arg Met Glu 
Val Gly Gin Gin Ala Val Glu Val Trp Gin <3ly Leu Ala Leu Leu Ser 
Glu Ala 

Val Leu Arg Gly Gin Ala Leu Leu Val Asn Ser Ser Gin Pro Trp Glu 
Pro Leu Gin Leu His Val Asp Lys Ala Val Ser Gly Leu Arg Ser Leu 
Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Lys Glu Ala He Ser Pro 

Pro Asp 

Ala Ala Ser Ala Ala Pro Leu Arg Thr He Thr Ala Asp Thr Phe Arg 
Lys Leu Phe Arg Val Tyr Ser Asn Phe Leu Arg Gly Glu Leu Lys Leu 
Thr Gly Glu Ala Cys Arg Thr Gly Asp Gly 

^ Please review the 

Seauence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> ^eWs of each sequence which presents at least one n or Xaa. 




Fede ral Register/ Vol. 63, No. 104/Monday. June 1, 1998/Rules a nd Regulations 
^pendlx A To Subpart C to Part 1-Sample Sec.^^e listing 
<110> Smith, John 



29639 



Smith, Jane 



<120> Example of a Sequence Listing 
<X30> 01-00001 

\ H 

<140> US 08/999,999 
<141> 1998-02-28 



<150> EP 91000000 
<1S1> 1997-12-31 



Feder al Rc^/Vol- 63. No. '™ 19 J| /RuleS and RegUlatl0nS 



<160> 2 



<170> Patentln vcr. 2.0 

<210> 1 
<211> 403 

<212> DNA \ \ 

i, 

* Hi \ . 

<213> Paramecium aur£lia 



<220> 
<221> CDS 
<222> 341. .394 



<300> 



<301> Doe, Richard 

<302> isolation and Characterization of a Gen. Encoding 

Protease from Paramecium sp. 
<303> Journal of Fictional Genes 



Federal Jfcglster/Vol. 63. No. 101 'Hnnrlir '""8/Rules and Regulations 296' 



<304> 1 



<y.os> 4 



<306> 1-7 



<307> 19BB-06-20 



<400> 1 



ctactctact ctactctcat\ctactatctt ctttggatct ctgagtctgc ctgagtggta 6 0 



ctcttgagtc ctggagatct ctcctctcac atgtgatcgt cgagactgac cgatagatcg 120 



:tgactgact ctgagatagt cgagcccgta cgagacccgt cgagggtgac agagagtggg 180 



cgcgtgcgcg cagagcgccg cgccggtgcg cgcgcgagtg cgcggtgggc cgcgcgaggg 240 



rtttcgcggc agcggcggcg ctttccggcg cgcgcccgtc cgcccctaga cctgagaggt 300 



cttctcttcc ctcctcttca ctagagaggt ctatatatac atg gtt tea atg ttc 

Met Val Ser Met Phe 



355 



age ttg tct ttc aaa tgg cct gga ttt tgt ttg ttt gtt tgtttgctc 4 03 

Ser Leu Ser Phe Lys Trp Pro Gly Phe Cys Leu Phe Val 
" 10 15 

\ \ 

<210> 2 v 

V 

<211> 18 
<212> PRT 

<213> Paramecium aurelia 
<400> 2 



Federal Renter/ Vol- 63, No. 104/Monday. June 1^98/Rules and Regulation^ 



Ce^^ 



29643 



Met Val Ser 



Met Phe Ser Leu Ser Phe Lys Trp Pro Gly Phe Cys Leu 



10 



15 



Phe Val 



cd: May 22. 1998. 
• A. Lehman, 

ant Secretary of Commerce and 
ilssloner of Patents and Trademarks. 
oc. 98-14194 Filed 5-29-98; 8:45 am) 
i code asio-ie-c 

i. 



M RtfV'irements for Applications - OG Date: 23 June 1998 



http://vwwuspto.gov/web/offiees/coni/sol/og/1998/week25/patreqr.htm 



« ^company xng xi.i.orma..xoii as snown x.i Lne xoxxowxny 
xuencxxxexs a.»u u.eu shall be used only in the "Sequence 

table. The numeric identifier shall oe u y information in the 

Listing." The order and presentation of the ^ems or ^ 
-Sequence Listing" shall conform to ^ "f™ g ^ h £l be gin with the 
item of information shall begin on a ™*JlU* and shall g ^ 
numeric identifier enclosed « •"g-^SlS an "M""s mandatory. The 
of those it^^^^^iS^S designated with an "0" is 
submission, of those items « 1 through <170> shall only be set 

illustrates the numeric identifiers. 



Numeric 
Identifier 

<110> 



Definition 



Applicant 



Comments and 
Format 

'Preferably max. 

of 10 names; 

one name per line; 

preferable format: 
> SurnWe, Other 
* Names, and/or 

Initials 



Mandatory (M) or 
Optional .(0) 

M 



<120> 



<130> 



<140> 



<141> 



<150> 



Title of 
Invention 

File Reference 



M 



Personal file 
reference 



Current Applica- Specify as: 
tion Number ' US 07/999,999 or 
PCT/US96/99999 



Current Filing 
Date 



Specify as: yyyy 



M when filed prior 
to assignment of 
appl. number 

M, if available 



-mm-dd. M, if available 



Prior Application Specify as: 
Number US 07/999,999 or 

PCT/US96/99999 



M, if applicable 
include priority 
documents under 
35 USC 119 and 
120 



<151> 



<160> 



Prior Application Specify as: 
Filing Date 



yyyy 



-mm-dd M, if applicable 



Number of SEQ ID Count includes 
NOs total number of 

SEQ ID NOs 



M 



<170> 



Software 



Name of software used 
to create the 
Sequence Listing 



<210> 



SEQ ID NO: ft : 



Response shall be an 
integer repre- 
senting the SEQ 
ID NO shown 



<211> 



Length 



Respond with an integer M 
expressing the number 
of bases or amino acid 
residues 



28 of 34 
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<212> 



Type 



<213> 



Organism 



<220> 



Feature 



<221> 



Name/Key 



<222> 



Location 



Whether presented 
sequence mole- 
cule is DMA, 
RNA, or PRT 
(protein) . If 
a nucleotide 
sequence con- 
tains both DNA 
and RNA frag- 
ments, the 
type shall be 
"DNA. " In ad- 
dition, the 
combined DNA/ 
j*NA molecule 
shall be further 
described in 
the 1 <220> to 
<223> feature 
v section. 

Scientific name, 
i.e. Genus/species, 
Unknown or Artifi- 
cial Sequence. In 
addition, the 
"Unknown" or 
"Artificial Se- 
quence" organisms 
shall be further 
described in the 
<220> to <223> 
feature section. 

Leave blank after 
<220>. <221-223> 
provide for a 
description of 
points of bio- 
logical signi- 
ficance in the 
sequence . 



.. US pto.BOvA«b/officcs/coiH/sol/og/l998/wcck25/patrcqr.htni 



Provide appropriate 
identifier for 
feature, pre- 
ferably from 
WIPO Standard 
ST. 25 (1998), 
Appendix 2, 
Tables 5 and 6 

Specify location 
within sequence; 
where appropriate 
state number of 
first and last 
bases/amino acids 



M 



M 



M, under the 
following condi- 
tions: if "n, " 
"Xaa, " or a mod- 
ified or unusual 
L-amino acid or 
modified base was 
used in a se- 
quence; if ORGAN- 
ISM is "Artifi- 
cial Sequence" or 
"Unknown"; if 
molecule is 
combined DNA/ RNA. 

M, under the fol- 
lowing conditions: 
if "n," "Xaa," or 
a modified or un- 
usual L-amino 
acid or modified 
base was used in 
a sequence 

M, under the fol- 
lowing conditions: 
if "n," "Xaa," or 
a modified or un- 
usual L-amino 
acid or modified 
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<223> 



Other Infor- 
mation 



http://^^ v 



in feature 



Other relevant 

information; 

four lines maximum 



uspto.gov/wcb/omccs/coni/sol/og/l998/wcck25/patrcqr.htm 



base was used in 
a sequence 

M, under the fol- 
lowing conditions: 



if 



"Xaa," or 



a modified or un- 
usual L-amino acid 
or modified base 
was used in a 
sequence; if 
ORGANISM 
is "Artificial 
Sequence" or 
"Unknown"; if 
molecule is com- 
bined DNA/RNA. 



<300> 



<301> 



Publication 
Information 

Authors 



Leave blank 
v after, <300> 

Preferably max 
of teri named 
authors of publi- 
cation; specify 
one name per line; 
preferable format: 
Surname, Other 
Names and/or 
Initials 



<302> 
<303> 
<304> 
<305> 
<306> 
<307> 



<308> 



<309> 



<310> 



Title 

Journal 

Volume 

Issue 

Pages 

Date 



Database 

Accession 

Number 



Database Entry 
Date 



Patent Document 
Number 



Journal date on which 
data published; 
specify as yyyy-mm- 
dd, MMM-yyyy or 

Season-yyyy 

Accession number 
assigned by data- 
base including 
database name 

Date of entry in 
database; specify 
as yyyy-mm-dd or 
MMM-yyyy 

Document number; 
for patent-type 
citations only. 
Specify as, for 
example, US 
07/999, 999 



0 
O 



1/29/99 1:53 PM 
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httpV/wW usplo.gov/wcb/olTiccs/coiu/sol/og/l998/wcck25/patrcqr.htnj 

# 



<311> 



<312> 



Patent Filing 
Date 



Document filing 
date, for patent- 
type citations only; 
specify as yyyy-mm-dd 



Publication Date Document publication 
date, for 
patent-type 
citations only; 
specify as yyyy-mm-dd 



<313> 
<400> 



Relevant 
Residues 

Sequence 



M 



FROM (position) TO 
(position) 

SEQ ID NO should 
'follow the 
numeric identifier 
and should appear 
on tiie line pre- 
\ ceding the actual 
sequence 

5. Section 1.824 is revised to read as follows: 

1.824 Form and format for nucleotide and/or amino acid sequence 
submissions in computer readable form. 

(a, The computer readable form required by 1.821(e) shall meet the 
following specifications: 

m The computer readable form shall contain a single "Sequence Listing" 
il^ItterTSf^ette, series of diskettes, or other permissible medxa 
outlined in paragraph (c) of this section. 

(2) The "Sequence Listing" in paragraph (a) (1) of this section shall be 
submitted in American Standard Code for Information Interchange (ASCII 
text. No other formats shall be allowed. 

(3) The computer readable form may be created by any means, such as word 
processors, nucleotide/amino acid sequence editors or other custom 
computer^rograms; however, it shall conform to all specifications 
detailed in this section. 

(4) File compression is acceptable when using diskette media, so long as 
tie comprised file is in a self-extracting format that will decompress 
on one of the systems described in paragraph (b) of this section. 

(5) Page numbering shall not appear within the computer readable form 
version of the "Sequence Listing" file. 

(6) All computer readable forms shall have a label permanently affixed 
thereto on which has been hand-printed or typed: the name of the 
applicant, the title of the invention, the date on which the data were 

S'n the computer readable form, the operatin g s y Jtem «sed. a 
reference number, and an application serial number and filing date, if 
known . 

(b) Computer readable form submissions must meet these format 
requirements: 

(1) Computer: IBM PC/XT/AT, or compatibles, or Apple Macintosh; 

(2) Operating System: MS-DOS, Unix or Macintosh; 
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